Extracting activist events from news articles using existing NLP tools and services

نویسنده

  • Thomas Ploeger
چکیده

Activists have a significant role in shaping social views and opinions. To find out specifically how they are doing this, social scientists study the events activists are involved in. Unfortunately, individual sources may present incomplete, incorrect, or biased event descriptions. We present a method where we automatically extract event mentions from di↵erent news sources that could complement, contradict, or verify each other. The extraction will ultimately feed a visual analytics application that supports discovery and analysis in large amounts of event data. This document is the full thesis, also attached is a scientific paper that describes the essence of the thesis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CRONE: A tool for investigating the reliability of activist events Thesis Design: Calculating Reliability for Online News Events

Recent developments in the domain of Information Retrieval triggered a revolution in social sciences. NLP and the semantic web provide methods to study various social phenomena from a crowd angle, instead of a domain expert angle. Extracting meaningful entities from raw text and aggregate the information gathered provides a basis for all kinds of visualizations. The MONA project is such a proje...

متن کامل

Extracting and Displaying Temporal Entities from Historical Articles

Our goal is to extract and display temporal entities in textual documents. The task involves identification of all events in a document followed by identification of important events using a classifier. We present a user with the key events and their associated people, places, and organizations within a document in terms of a timeline and a map. For purposes of testing, we use historical Wikipe...

متن کامل

A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model

Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...

متن کامل

Extracting Named Entities Using Named Entity Recognizer and Generating Topics Using Latent Dirichlet Allocation Algorithm for Arabic News Articles

This paper explains for the Arabic language, how to extract named entities and topics from news articles. Due to the lack of high quality tools for Named Entity Recognition (NER) and topic identification for Arabic, we have built an Arabic NER (RenA) and an Arabic topic extraction tool using the popular LDA algorithm (ALDA). NER involves extracting information and identifying types, such as nam...

متن کامل

Unsupervised Storyline Extraction from News Articles

Storyline extraction from news streams aims to extract events under a certain news topic and reveal how those events evolve over time. It requires algorithms capable of accurately extracting events from news articles published in different time periods and linking these extracted events into coherent stories. The two tasks are often solved separately, which might suffer from the problem of erro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013